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AMINO ACID-ENRICHED PLANT PROTEIN RESERVE^, IN 
PARTICULAR LYSINE-ENRICHED MAIZE y-ZEIN, AND PLANTS 
EXPRESSING SUCH PROTEINS 

The present application relates to novel means enabling plants expressing 
5 protein reserves which are enriched in amino acids and which are j deficient in 
normal protein reserves to be prepared, in particular to lysine-enricihed protein 
reserves. The invention also provides the thus modified protein reserves, and 
plants expressing these modified protein reserves. 

Many plants, some after transformation using physico-chemical steps, are 

10 of major economic importance for human or animal foodstuffs, and th^ problem of 

i 

improving their nutritional quality has already given rise to different types of 

research. In particular, to overcome the insufficiency of certain amino acids in 

i 

plant protein reserves, selected varieties have been developed which have superior 

i 

nutritional qualities, or different modifications have been proposed! which use 
15 genetic engineering techniques to encourage or increase production inj such plants 
of certain deficient amino acids which are nevertheless important for the nutritional 

! 

qualities of the plant. Examples of deficient amino acids are lysine and methionine. 

Within the context of the present application, the inventors haye proposed 
an original solution to the problem of improving plants, in particular improving 
20 their nutritional qualities, in the first instance using a plant of considerable 
economic importance, namely maize. More precisely, they have concentrated on 
the protein reserves in maize seed endosperm, which comprise zeins, ib particular 
y-zein. 

As maize seeds develop, the cells of the endosperm synthesise large 
25 quantities of protein reserves, in particular a-, B- and y-zeins. Such zeins are 
accumulated in protein bodies derived from the endoplasmic reticulum (ER). 

In general, zeins represent a complex protein group divided intjo a number 
of groups, a-, 6-, y- and 5-zeins (Larkins et al, 1989) encoded by a| multigenic 
family (Hagen and Rubenstein, 1980, Gene 13, 239-249). While their jstructure is 
30 variable, such proteins have common features: the presence of tandem repeats in 
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their primary structure which are rich in proline type amino acid residues, the 
presence of numerous hydrophobic residues which result in the insolublility of such 
proteins in aqueous media, and the absence of lysine residues, essential amino acids 
for man and for monogastric animals. The absence of lysine in all of the major 
5 proteins (detected in large quantities in the endosperm) naturally produced in the 
zein group leads to an unbalanced amino acid composition in maize seeds. 

Of such proteins, maize y-zein is a protein with a molecular weight of 28 
kDa, the coding sequence for which has been described in the cDNA form by Prat 
et al (Nucleic Acids Research, vol. 13, n° 5, 1985, p 1494-1504). The complete 
10 sequence of the gene coding for the y-zein, including the upstream and downstream 
non coding sequences containing the expression regulation elements, has been 
described by M. Reina et al. (Nucleic Acids Research, vol. 18, n° 21, 1990, p 
6426). 

Up until now, different approaches have been envisaged for increasing the 

15 amount of lysine in proteins of the zein group. In this respect, genetic and 
molecular approaches have been carried out. As an example, mutants for obtaining 
lysine-rich maize such as opaque-2 mutant (o2) and floury-2 mutant (fl-2) (Mertz 
et al., 1964, Science 145, 279-280, Nelson et al., 1965, Science 150, 1469-1470) 
have been proposed and attempts have been made to remedy the deleterious effects 

20 of the absence of certain classes of zeins, in particular a-zeins, on the phenotype 
characteristics by selecting maize containing o2modifying genes (Paez et al., 1969, 
Plant Sci. 9, 251-252, Geetha et al, 1991, Plant Cell 3, 1207-1219). 

Another approach has consisted in taking indirect action on the production 
of free lysine, in particular in dicotyledonous plants. That technique involved 

25 deregulating key enzymes (DHTPS and AK) involved in the lysine via aspartate 
biosynthesis cycle. A cross sensitive to the levels of free lysine was obtained in the 
leaves, but not in the seeds, in tobacco plant transformation experiments with R 
Coli bacteria containing dapA genes and E. Coli bacteria containing the lysC gene 
(Shaul and Galili, 1992, Plant J. 2, 203-209 and 1993, Plant Mol. Biol 23, 759- 

30 768; Perl. A., Schaul O., Galili. G., 1992, Plant Molecular Biology 19, p 815-823). 



Recently, the same genes, dapA from Corvnebacterium and lysC from E. Colt 
were used and expressed under the control of a specific promoter of seeds in soya 
plants. Expression of these two enzymes in soya led to a five times increase in the 
amount of lysine in the seeds (Falco et al., 1995, BIO-Technology 13, 577-582). 

Other authors (Wallace et al., 1988, Science 240, 662-664) attempted to 
increase the lysine in a-zein (19 kDa) in maize seeds by point incorporation of 
lysine residues at different positions in the a-zein molecule. Expression of these 
constructs in Xenopus oocytes led to proper assembly of lysine-rich zeins in 
analogous vesicles of protein bodies. However, the normal a-zein and the lysine- 
enriched modified form were degraded when they were expressed in tobacco seeds 
(Othani et al., 1991, Plant Mol. Biol, 16: 117). 

Thus there is currently no knowledge regarding the means which could 
enable expression of a lysine-enriched zein in cells producing it naturally in maize, 
i.e., in the endosperm cells. A fortiori, expression of lysine-enriched zeins in other 
plant cells has not been mastered. 

One aim of the invention is thus to provide means for obtaining a lysine- 
enriched zein, in particular a lysine-enriched maize y-zein, this protein being 
expressed particularly in maize seed cells and in particular in endosperm cells, said 
modified protein further being expressed such that its properties as regards 
localisation and accumulation in the endoplasmic reticulum and derivative protein 
bodies are preserved. 

The expression "lysine-enriched" used in the present application means that 
the protein includes an increased number of lysine residues with respect to the 
natural protein from which it is derived, for example as a result of modifying the 
nucleotide sequence expressing it. 

The invention also provides means for obtaining expression of proteins, 
preferably lysine-enriched y-zeins, in plant cells of different tissues, for example 
leaf tissue or root tissue, and if necessary in the cells of plants which do not 
naturally express the protein, in particular the y-zein the production of which is 
desired. 



In addition, in one particular implementation of the invention, other protein 
reserves can be enriched in lysine under analogous conditions. 

In a first aspect, the inventors propose to introduce into the gene coding for 
the y-zein or for other protein reserves of maize or other plants, or into the coding 
sequence of this gene, sequences coding for lysine-enriched polypeptides, in order 
to produce lysine-enriched y-zeins or other proteins and thus to produce lysine- 
enriched seeds. Different sites in the coding sequence of the y-zein gene have been 
identified as allowable sites (also known as neutral sites) to prepare the modified 
nucleotide sequences. 

The present application thus proposes means for transforming the gene 
coding for the maize y-zein or for transforming any nucleotide sequence coding for 
the y-zein and derived from that gene, so as to obtain, by expression of the 
modified gene or, more generally, of the modified nucleotide sequence, a lysine- 
enriched protein; these means in particular include synthetic oligonucleotides 
coding for an amino acid sequence comprising lysine residues. 

The invention also provides recombinant nucleotide sequences or chimeral 
sequences which can code for a lysine-enriched y-zein. 

Still further, the invention provides host cells transformed by such 
sequences, in particular plant cells, for example cells enabling plant regeneration, 
also plants or plant portions (tissues, organs...) containing such cells and 
producing modified protein reserves in a stable manner, in particular lysine- 
enriched y-zeins. 

The invention also encompasses said modified proteins, for example lysine- 
enriched proteins, and antibodies directed against these proteins. 

An appropriate oligonucleotide for carrying out the invention for use in 
preparing recombinant nucleotide sequences is characterized in that it comprises at 
least one concatenation coding for a polypeptide with formula (P-K) n , where: 

• n is a whole number of 2 or more; 

• P represents a proline amino acid residue; 

• K represents a lysine amino acid residue; 
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• the symbol represents a bond between the two amino acid residues, in 
particular a peptide type bond, the n (P-K) units also being bonded together by 
such bonds, for example peptide type bonds. 

In a first embodiment, an oligonucleotide of the invention is thus 
5 characterized in that it comprises a sequence coding for a series of repeated 
moieties comprising two amino acids. 

The oligonucleotide codons may be identical for all of the proline and/or 
for all of the lysine residues. They may also be different for the same amino acid 
residue, the variation taking the degeneracy of the genetic code into account. 
10 This oligonucleotide is preferably formed by a sequence coding for more 

than 2 (P-K) units. Preferably, n is 30 or less, in particular below 20 and 
advantageously, n equals 4, 5, 6, 7, 8, 9 or 10, or 15. 

The "oligonucleotides" of the invention can be chemically synthesised using 
any available technique. 
15 The term "polypeptide" referring to the concatenation (P-K) n as used in the 

present invention means a sequence of amino acids containing more than 2 amino 
acid residues and which may comprise up to 60 amino acid residues. 

In a first variation of the invention, the oligonucleotide comprises several 
concatenations coding for a polypeptide with formula (P-K) n , identical or different, 
20 associated in tandem. 

These oligonucleotides are either repeats of a single concatenation, or 
associations of different concatenations. The number of associated concatenations 
can vary, for example in the range 2 to 10 concatenations. 

In a further variation of the invention, the oligonucleotide defined above is 
25 characterized in that it comprises at least one concatenation coding for a 
polypeptide with formula (P-K) n , in which the sequence of n (P-K) units is 
interrupted by one or more amino acid residues other than P or K residues. 

The supplemental amino acids incorporated into the sequence formed by 
the (P-K) units are preferably selected so as not to modify the organisation of the 
30 polypeptide coded by the oligonucleotide, or at the very least not to cause 
interaction with the amino acids of a protein into which said polypeptide would be 
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incorporated, under conditions which would affect the structure and/or function 
and/or localisation of this protein. 

This can in particular be the case when the number of (P-K) units is high or 
when several concatenations formed from sequences coding for (P-K) n moieties are 
5 associated in tandem and when the preparation of the corresponding 
oligonucleotide requires that several nucleotide sequences be synthesised which are 
then associated by means of linkers, for example. 

In a further embodiment of the invention, the oligonucleotide is such that 
the concatenation coding for the polypeptide comprising the n (P-K) units is 
10 completed at its 5' end and/or at its 3' end by one or more codons coding, for 
example, for at least one lysine residue at the N-terminal extremity of the formed 
polypeptide. 

As an example, a preferred oligonucleotide of the invention is characterized 
in that it codes for a polypeptide with formula (P-K), formula K-(P-K) 4 , or with 
15 formula 2K(P-K) 4 . 

In a particular embodiment, the composition of this oligonucleotide 
corresponds to one of the sequences described in the following pages and identified 
by designations SEQ ID No: 1 and SEQ ID No: 2. 

The oligonucleotides described above constitute the basic resource for 
20 producing recombinant nucleotide sequences capable of expressing lysine-enriched 
plant protein reserves or polypeptide reserves. 

The invention thus provides a recombinant nucleotide sequence comprising a 
concatenation of nucleotides coding for a plant protein reserve, characterized in 
that it further comprises an oligonucleotide of the invention, inserted at one site of 
25 the nucleotide concatenation selected such that: 

• expression of the nucleotide sequence in a particular plant cell enables a 
modified protein reserve to be produced which is localised in that cell in a 
manner identical to or similar to the normal protein reserve which would be 
expressed in the same cell under the same conditions by the corresponding 
30 normal coding nucleotide concatenation; and/or 
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• the modified protein reserve coded by the nucleotide sequence is 
immunologically recognised by antibodies produced against the corresponding 
normal protein reserve. 

In particular, the antibodies cited above are constituted by a polyclonal 
5 serum or are obtained against epitopes of the normal protein reserve which are 
conserved in the modified protein reserve. 

The plant cells referred to above include any plant cell, regardless of its 
tissue origin or its nature. Reserve organ cells are of particular interest within the 
context of the invention, but also the cells of leaves, stems, tubers. . . . 
10 The expression "protein reserve" of a plant as used in the present 

application means a protein synthesised during seed maturation and which is used 
during the germination phase as the principal food reserve. 

In general, it concerns a polypeptide which can be synthesised in reserve 
tissue regardless of its location in the plant, in particular the protein reserves used 
15 in the present invention are those produced in the grain or seed of plants in the 
cereal, crucifer or legume group, and are, for example, prolamins or zeins. 

The choice of the site(s) for inserting the oligonucleotide in the 
concatenation coding for the plant protein reserve is determined by satisfying the 
conditions described above. Depending on the case, insertion may take place in a 
20 repeat (in terms of amino acid sequence) of the protein or at the C- or N-terminal 
extremity. 

The condition given above in which expression of the recombinant 
nucleotide sequence of the invention in a plant cell enables a modified protein 
reserve to be obtained, localised identically or similarly to the normal protein 
25 reserve which would be expressed under the same conditions in the same plant cell, 
comprising, for example for synthesised y-zeins, the possibility of being 
accumulated in the endoplasmic reticulum of plant cells expressing it, in particular 
in the protein bodies formed from the endoplasmic reticulum, when the protein is 
expressed in endosperm cells. 
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In order to obtain this result by means of the recombinant nucleotide 
sequences of the invention, expression systems adapted to the cell host in which 
the selected nucleotide sequence is expressed, and in particular the regulation 
elements, for example, promoters, are selected for their functional character in the 
tissue containing the transformed cells. Tests for making this selection can be 
carried out using the different constructs described in the examples. 

To verify that the immunological properties of the modified protein reserve 
expressed by the nucleotide sequence of the invention have not been modified 
consequently, antiserums such as aG2 antiserum, described more precisely in the 
experimental section below, have been used, for example. 

In a first embodiment of the invention, the recombinant nucleotide sequence 
is characterized in that it is obtained from a nucleotide coding concatenation which 
leads to expression of a protein reserve which is naturally depleted in lysine. 

In general, this recombinant nucleotide sequence codes for a modified 
protein reserve derived from a protein reserve which is naturally produced by a 
plant for use in animal or human foodstuffs. 

Thus protein reserves in which the lysine content has been modified within 
the context of the present invention are advantageously plant protein reserves from 
the cereal, legume or crucifer group. Particularly important protein reserves are 
those in maize, in particular zeins, and more particularly maize y-zein, for which 
the lysine content is intended to be increased. 

One particular recombinant nucleotide sequence of the invention is 
characterized in that the coding concatenation of nucleotides coding for the maize 
y-zein which it contains has the sequence shown in Figure 9. 

Other recombinant nucleotide sequences of the invention are characterized 
in that the coding concatenation of nucleotides they comprise codes for a protein 
reserve of a plant selected from the following; soya, sunflower, tobacco, wheat, 
oats, alfalfa, rice, oilseed rape, sorghum, and Arabidopsis . 

In a preferred embodiment of the invention, in the recombinant nucleotide 
sequence comprising a concatenation coding for maize y-zein, the oligonucleotide 
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of the invention is inserted in place of the concatenation coding for the Pro-X 
domain naturally present in the maize y-zein amino acid sequence or following this 
concatenation. The Pro-X domain of the maize y-zein amino acid sequence is 
constituted by the amino acids located between positions 70 and 91 of the amino 
5 acid sequence shown in Figure 9, corresponding to nucleotides 265 to 330 of the 
sequence shown in Figure 9. 

Preferably, in the nucleotide sequence of the invention, the oligonucleotide 
in place of or following the Pro-X domain is present between nucleotides 276 and 
357 of the sequence shown in Figure 9. 
10 In a further embodiment of the invention, in the recombinant nucleotide 

sequence comprising a concatenation coding for maize y-zein, the oligonucleotide 
of the invention is inserted following the Pro-X domain conserved in the maize y- 
zein sequence. 

In a further variation, in the recombinant nucleotide sequence comprising a 
15 concatenation coding for maize y-zein, the oligonucleotide of the invention is 
inserted into the Pro-X domain maintained in the y-zein sequence. 

The above insertions can be carried out using available techniques, for 
example recombination of sequences which have undergone one or more enzymatic 
digestion steps. 

20 In a particular embodiment of the invention, a selected protein reserve enriched 

in a particular amino acid is expressed in heterologous plant cells. In other words, a 
protein reserve which is naturally present in a given plant is expressed in an amino acid- 
enriched form in another plant or in a cell other than that in which it is naturally 
produced. 

25 In addition to the concatenation coding for a plant protein reserve and the 

oligonucleotide of the invention, the recombinant nucleotide sequences of the invention 
can also comprise an expression promoter, for example a promoter selected for its 
specific expression character in certain parts or tissues of the plants, or in contrast a 
promoter selected for its constitutive character. As an example, when they are specific, 

30 the promoters can be specific for seeds and/or organs or particular plant tissues. They 
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can alternatively, or also, be specific for one growth phase, for example a particular 
stage of germination. 

In contrast, the use of constitutive promoters means expression of the 
protein reserve is constant and general, causing competition between expression of 

5 the native protein reserve, when it is present, and the modified protein reserve. 

As an example, advantageous promoters for carrying out the invention are 
the maize y-zein promoter contained in the 1.7 kb sequence found upstream of the 
coding sequence shown in Figure 7, the cauliflower mosaic virus promoter, namely 
the promoter CaMV35S (European patent EP-B-0 131 623), the constitutive 

10 promoter for the actin 1 rice gene (PCT application PCT/US 9100073) or the 
specific "high molecular weight gluthenine" seed promoter for wheat (Colot V. et 
al., 1987, EMBO Journal, vol. 6, p 3559-3564). 

If necessary, these promoters are completed by other regulation sequences, 
in particular expression activators. 

15 Examples of other promoters which can be used in carrying out the 

invention are the promoter for the gene coding for the 2S protein reserve in 
Arabidopsis thaliana , or the lectin or B-phaseoline promoters for beans. 

Supplemental introduction of expression activators into the regulation 
sequences of the nucleotide sequences of the invention can also increase the level of 

20 primary transcription of the nucleotide sequence and, if appropriate, increase the 
quantity of modified protein reserves produced. The activators are, for example, 
introns of monocotyledons such as intron 1 of the rice actin gene. 

The invention also provides a cloning and/or expression vector, 
characterized in that it comprises, at a site which is not essential for replication, a 

25 nucleotide sequence satisfying one of the definitions given above. Examples of 
vectors of particular interest within the context of the present invention are the 
plasmids pP20yZ, pH30yZ or pH45yZ. Plasmid pP20yZ was deposited at CNCM 
[National Collection of Micro-organism Cultures] (Paris, France) on 31 st October 
1995, registration number 1-1640. Plasmid pH45yZ was deposited at the CNCM 

30 on 3 1 st October 1995, registration number 1-1639. 
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The scope of the invention also encompasses a polypeptide as expressed by 
a recombinant nucleotide sequence satisfying the above definitions. 

The expression "polypeptide" as used within the context of the invention 
does not introduce a particular limitation as regards the number of amino acids 
5 forming the polypeptide. It may include sequences comprising several amino acids, 
normally termed peptides, or much longer sequences such as those in proteins. 

In this regard, the invention provides lysine-rich modified maize y-zein, 
characterized in that it is coded by a recombinant nucleotide sequence as described 
above. 

10 In a preferred embodiment of the invention, the ly sine-enriched modified 

maize y-zein is characterized in that its amino acid sequence is modified by at least 
one polypeptide with formula (P-K) n:) where: 

• n is a whole number of 2 or more; 

• P represents a proline amino acid residue; 
15 • K represents a lysine amino acid residue; 

• the symbol represents a bond between the two amino acid residues, in 
particular a peptide type bond, the n (P-K) units being bonded together by such 
bonds, in particular peptide type bonds. 

In a variation, the polypeptide integrated in the y-zein amino acid sequence 
20 has the formula K-(P-K) n . 

The polypeptides of the invention with one of formulae (P-K) n , K-(P-K) n or 
with one of the variations are substituted for a sequence naturally present in the 
normal maize y-zein or inserted with deletion of one or more amino acids of the 
amino acid sequence of normal maize y-zein, or added to the normal y-zein amino 
25 acid sequence, the insertion site for the polypeptide being selected such that: 

• when the modified lysine-rich y-zein is produced in a host cell, in particular in a 
plant cell, it is localised in that cell in identical or similar manner to normal 
maize y-zein which would be produced under the same conditions, in the same 
host cell; and/or 
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• the modified maize y-zein is recognised by antibodies directed against the 
normal maize y-zein. 

The P20yZ proteins shown in Figure 11 or H30yZ or H45yZ shown in 
Figure 10 are preferred embodiments of the invention and represent lysine-enriched 
5 modified maize y-zeins. 

The invention also provides a recombinant host cell, characterized in that it 
comprises a nucleotide sequence as described above. 

Examples of host cells of interest are bacterial cells, such as E. Coli or 
Agrobacterium tumefaciens . Preferably, within the context of the invention and for 
10 stable expression of the desired modified protein reserve, host cells of plant origin 
will be used. 

As an example, the cells of plant origin are seed, plant and, for example, as 
is preferred, maize seed endosperm cells. 

The nucleotide sequence of the invention is preferably introduced into the 
15 genome of the host cell in a stable manner and under conditions such that the 
expressed protein reserve which is enriched in amino acids, in particular lysine, is 
localised as the corresponding normal protein would be in the same host cell. 

A variety of techniques are available for transforming host cells. Examples 
for transforming host cells in a stable or transient manner, electroporation, 
20 bombarding with microprojectiles carrying DNA using a particle cannon, explant 
culture with Agrobacterium tumefaciens , by microfibre penetration. 

In addition to maize seed endosperm cells, soya, sunflower, tobacco, 
wheat, oats, alfalfa, rice, oilseed rape, sorghum or Arabidopsis cells can be used to 
express the nucleotide sequences of the invention. 
25 The present application also relates to seeds producing a polypeptide as 

described above and the plants producing this polypeptide. These plants are 
preferably maize. 

The invention also relates to seeds obtained from transformed plants 
expressing the polypeptide of the invention, in other words the modified protein 
30 reserve enriched with particular amino acids. 
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In a particularly interesting embodiment of the invention, the modified 
lysine-enriched y-zein proteins are expressed in opaque-2 maize mutants. The 
lysine content of these o2 mutants described by Emerson R. A. et al., (1935, 
Cornell Univ. Agric. Exp. Stn. Mem. 180) and characterized by Mertz E. T. et al, 
5 (1964, Science 145: 279-280) is substantially increased thus greatly increasing the 
nutritional qualities of the maize (compensating for the its low level of this essential 
amino acid). Conventional maizes have a lysine content of about 0.24% of the raw 
product (total grain weight), but opaque-2 maizes have close to 0.5% of lysine. 
However, they have insufficient agronomic characteristics as their endosperm is far 

10 less vitreous and is very friable ("starchy" phenotype). This renders them 
extremely sensitive to pathogenic organisms and to post harvest treatments. This 
phenotype is due to a large reduction in certain protein reserves, in particular alpha 
zeins. In fact, opaque-2 codes for a transcription factor necessary for expression 
of certain zein genes (Schmidt R. J. et al, 1990, Proc. Natl Acad. Sci. USA, 87, 

15 46-50). 

Opaque-2 derivatives no longer having the disadvantages cited above have 
been developed by conventional genetic improvement, namely QPM (Quality 
Protein Maize). Recent genetic analysis of these maizes (Lopes M. A., et al., 
1995, Theor. Appl. Genet. 19, 274-281) has shown that only 2 or 3 loci are key 

20 loci in these favourable modifications. More detailed genetic and biochemical 
analyses have resulted in the postulation that one of the 3 loci responsible is the y- 
zein locus: maize genotypes which carry a duplicate of this gene located in the 
centrometric region of chromosome 7 have all been shown to be opaque-2 
modifiers (Lopes M. A. et al., 1995, Mol. Gen. Genet. 19: 247: 603-613). 

25 The present invention also enables opaque-2 mutant maizes to be prepared 

from maize having only one y-zein gene in chromosome 7, which are 
complemented by addition of a recombinant sequence coding for a lysine-enriched 
maize y-zein. In addition to acquiring hardness properties similar to a non mutant 
opaque-2 maize, it has the advantage of significantly increasing the lysine content, 

30 thus exceeding that of QPM maize. 
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The present invention enables modified maize opaque-2 mutants to be 
obtained, into which a recombinant nucleotide sequence coding for a lysine-enriched 
maize y-zein has been inserted. 

The invention also provides a method of obtaining plants or seeds 
5 expressing a modified protein reserve, characterized in that it comprises the steps 
of: 

a) transforming a plant cell, with a nucleotide sequence or a vector as described 
above, under conditions enabling the modified protein reserve coded by the 
nucleotide sequence to be expressed in a stable and functional manner; 
10 b) regenerating plants from the plant cell transformed in step a), to obtain plants 
expressing the modified protein reserve; 
c) if necessary, obtaining seeds from the modified plants obtained in step b). 

In an advantageous implementation of the invention, the transformed plant 
is maize and the enriched modified protein reserve is lysine-enriched y-zein. 
15 The invention also relates to plants obtained by carrying out such a method. 

In order to evaluate the content of a given amino acid in plants of the 
invention, it is possible to use an assay protocol such as that described in Zarkadas 
et al, 1995, J. Agri. Food Chem. Vol. 43: pages 84-93. 

Further characteristics and advantages of the invention will become 
20 apparent from the following examples and the accompanying figures. 
Figure 1 

• Restriction map of plasmid pP20yZ; 
Figure 2 

• Restriction map of plasmid pH45yZ; 
25 Figure 3 

• Schematic representation of proteins coded by modified and non modified y-zein 
genes: wild type y-zein (yZ), and lysine-rich y-zeins (P20yZ, H30yZ, H45yZ and 
N13yZ) resulting from inserting oligonucleotides coding for lysine-rich 
sequences. The amino acid sequence of the inserted polypeptides is indicated 
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using the single-letter amino acid designations. The following abbreviations are 
used; 

Term: terminal; 

ProX DOMAIN: proline-Xaa linker domain. 
5 Figure 4 - In-vitro analysis of lysine-rich y-zeins- (A) in vitro translation and 
translocation of transcripts corresponding to lysine-rich modified y-zeins; lines 1, 5, 
9 and 13: complete translation products; lines 2, 6 ? 10 and 14: complete translation 
products after translocation in canine microsomes (CM); lines 3, 7, 11 and 15: 
translocation products resistant to the action of proteinase K (PK); lines 4, 8, 12 

10 and 16: totality of translation products after treatment with proteinase K in the 
presence of 0.5% Nonidet P40 (NP40). (B) Immunoprecipitation of in vitro 
translation products corresponding to y-zein proteins and lysine-rich modified y- 
zein, using aPL antiserum. Line 1: y-zein; line 2: P20yZ; line 3: H30yZ; line 4: 
H45yZ and line 5: N13yZ. (C) Same legend as for (B) but using aG2 antiserum. 

15 The molecular weight markers (in kilodaltons) are shown on the left. 

Figure 5 - Tissue-specific activity of the y-zein promoter. Maize endosperms, 
embryos and leaves were transformed by bombarding with particles using the 
constructs represented in the figure (in the right hand portion). The relative 
activities of luciferase (LUC, grey columns) and B-glucuronidase (GUS, hatched 

20 columns) are expressed in the form of a multiplier of the values obtained with 
naked projectiles ± the standard deviation of the different ratios. 
Figure 6 - Expression of lysine-rich y-zeins in the sub-aleuronic endosperm of 
cells. (A) Immunoblot with aPL antiserum, of proteins extracted from endosperms 
transformed by pN13yZ (line 2), pH45yZ (line 3), pH30yZ (line 4) and pP20yZ 

25 (line 5). The control (line 1) corresponds to non transformed endosperms. The 
molecular weight markers (in kilodaltons) are shown on the left hand side. (B) 
Expression of transcripts H45yZ and N13yZ in transiently transformed 
endosperms. The cDNAs obtained from tissues transformed with pH45yZ (line 2), 
pN13yZ (line 3) and the control (line 1) were amplified by PCR and analysed using 

30 a synthetic oligonucleotide coding for a lysine-rich sequence used as a probe. 
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Figure 7 - Accumulation of lysine-rich y-zeins in the protein bodies of the 
endosperm. (A) Immunoblot analysis, using aPL antiserum, of protein bodies 
isolated from endosperms transformed with pP20yZ (line 1), pH30yZ (line 2), 
pH45yZ (line 3), pN13yZ (line 4) and no DNA (line 5). (B) Immunoblot analysis, 

5 using aPL antiserum, of protein bodies isolated from endosperms transformed with 
pP20yZ, pH30yZ and pH45yZ and digested with proteinase K in the presence of an 
isotonic buffer (Sugar., lines 1, 3 and 5) or a hypotonic buffer (H 2 0, lines 2, 4 and 
6). The molecular weight markers (in kilodaltons) are shown on the left hand side. 
Figure 8 - Co-localisation of P20yZ proteins with a- and y-zeins in protein bodies 

10 of maize endosperm. An immunocytochemical analysis was carried out on 
ultrafine sections using aPL antibodies (labelled with 15nm diameter gold 
particles) and aZ and aG2 antibodies (labelled with 5 nm particles). (A) protein 
bodies of endosperm transformed with pP20yZ, immunolabelled with aPL 
antibody. (B) Immunolocalisation of P20yZ (labelled with 15 nm gold particles) 

15 and y-zein (labelled with 5 nm gold particles) in protein bodies isolated from 
endosperms transformed with pP20yZ. (C) and (D) Immunolocalisation of P20yZ 
(labelled with 15 nm gold particles) and y-zeins (labelled with 5 nm gold particles) 
in protein bodies isolated from endosperms transformed with pP20yZ. The arrows 
indicate tangential sections of the protein bodies. 

20 Figure 9- Coding sequence of maize y-zein cDNA and the corresponding amino 
acid sequence. 

Figure 10 - Coding sequence of cDNA of the H45yZ maize zein and the 
corresponding amino acid sequence. 

The lysine-rich sequence (28 amino acids) was introduced between amino 
25 acid residues 92 and 1 19 of the sequence shown in Figure 10. 

Figure 11 - Coding sequence of cDNA of the P20yZ maize zein and the 
corresponding amino acid sequence. 

The lysine-rich sequence (14 amino acids) was introduced between amino 
acid residues 92 and 1 19 of the sequence shown in Figure 11. 
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Figure 12 - Restriction maps for plasmids Pbin 19P20/Z and pBinl9H30yZ. 
Figure 13 - Transgenic maize plant endosperms accumulating lysine-enriched y- 



A and B: SDS-page and immunoblot using aPL antiserum. 

5 A) 10 jig of protein per track (transformants with construct H45yZ) 

track C: protein extract from endosperms of hybrids B73xA188 (control) 
track 1: Al 
track 2: Bl 
track 3: B2 
10 track 4: CI 
track 5: Dl 
track 6: D2 

B) 1 |ig of protein per track (transformants with construct P20yZ) 

15 track C: control 

track 1; Al 

track 2: A2 

track 3: Bl 

track 4: CI 
20 track 5: Dl 

track 6: El 

C) SDS-PAGE and stain with silver (3 P20yZ transformants and 3 H45yZ 
transformants) 

25 track 1: A2 1 



zem. 
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track 2: Bl 
track 3: CI 
track 4: Bl 
track 5: Al 
track 6: D2 



10 jig of protein per track 



track 7: Bl 
track 8: Al 
track 9: D2 



J 



40 |ng of protein per track 
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Figure 14 - Lysine-enriched y-zein content per grain (transformants 45yZ Bl and 
CI) 
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A: silver stain; 10 jug of protein per track; 

B immunoblot using aPL antiserum; 1 \ig of protein per track; 

Tracks 1 to 5; protein extracts from different endosperms of transformant 45yZ 

Bl; 

5 Tracks 6 to 10: protein extracts of different endosperms of transformant 45yZ CI. 
Figure 15: 

A) Lysine-enriched y-zein content of 10 grains (transformant 45yZ CI) using aPL 
antiserum, 1 jig of protein per track; 
Tracks 1 to 10: endosperm extract from 10 descendants; 
10 B) Immunoblot of protein extracts of endosperms 1 to 5 present in A) and labelled 
with aG2 antiserum; 2 jig of protein per track. 

EXAMPLES 

A) Preparation of lysine-enriched modified y-zeins and expression of these 
15 modified proteins followed by accumulation in protein bodies of maize 

endosperm cells 

y-zein is a tulphur-rich maize protein reserve, with a molecular weight of 28 
kD which is accumulated in endosperm cells with a- and B-zeins, in protein bodies 
derived from the endoplasmic reticulum (ER) of the grain (Ludevid et al, 1984, 

20 Plant Mol. Biol. 3, 227-234; Lending et al, 1984, Plant Cell 1, 1011-1023). The 
amino acid sequence deduced from the cDNA nucleotide sequence (Prat et al., 
1985, Nucl. Acids Res. 13, 1493-1504) and genomic clones (Boronat et al., 1986, 
Plant Sci. 47, 95-102) show that the y-zein has no homology with a-zein type 
polypeptides. While the y-zein is coded by 1 or 2 genes per haploid genome 

25 (Boronat et al., 1986, Plant Sci., 47, 95-102), it represents 10-15% of the totality 
of the maize endosperm proteins. Expression of the y-zein gene in heterologous 
systems such as Xenopus oocytes (Torrent et al., 1994, Planta 192, 512-518) and 
in Arabidopsis thaliana (Geli et al, 1994, Plant Cell 6, 1911-1922), indicates that 
y-zein polypeptides remain stable and are capable as they are of forming protein 

30 bodies derived from the endoplasmic reticulum inside the cells. Further, analyses 
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involving deletion of different structural domains from the y-zein have shown that 
the N-terminal sequence including the proline-rich repeat is responsible for 
retaining the y-zein in the endoplasmic reticulum and the cysteine-rich C-terminal 
domain is responsible for forming the protein bodies. The Pro-X domain does not 
5 appear to affect the stability of the protein nor its targeted localisation (Geli et al, 
1994, Plant Cell 6, 1911-1922). 

Material and methods 
Plant material 

After surface sterilisation (1), grains at stage 17 DAP (days after 
10 pollination) of W64A maize were dissected by hand and the pericarpal layer and 
the aleurone were separated from the endosperms. Tangential sections were made 
to expose a large part of the sub-aleuronic surface. If necessary, embryos were 
isolated and leaves from 7 day old plants were dissected to extract the epidermal 
tissue. After dissection, the samples were placed in petri dishes on filter paper 
15 moistened with MS medium (Murashige and Skoog, 1962, Physiol. Plant 15, 473- 
497). 

Plasmid constructs 

A first group of plasmids, pKSG2, pHpP2, pPbP4 and pNaNl, was 
obtained to enable restriction sites to be introduced into the gene coding for y-zein. 
20 pKSG2 and pHbP2 were constructed in accordance with the description in the 
publication by Torrent et al. (Planta (1994) 192: 512-518). Plasmid pKSG2 
contained the sequence coding for y-zein. 

Plasmid pHbP2 was obtained from pKSG2 and contained a sequence 
coding for a mutated y-zein from which the Pro-X domain of the protein had been 
25 deleted. 

Plasmid pPbP4 was obtained following two cloning steps: (I) the restriction 
fragment Sall-Pvull of 350 kb from pKSG2 was cloned in a Bluescript plasmid 
(pBSKS, Stratagen, La Jolla, California, USA) restricted with Sail and EcoRV 
(pKSC4) and (ii) the restriction fragment Pvull-Xbal of 600 bp from pKSG2 was 
30 cloned in restriction sites Smal-Xbal of pKSC4. The new construct pPbP4 
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contained a useful EcoRI restriction site just before the P-X domain of the y-zein 
coding sequence. 

Plasmid pNaNl was also obtained following two cloning steps, (i) the 
Nael-Xbal fragment of 250 bp from pKSG2 was cloned in the plasmid pBSKS 

5 restricted with EcoRV-Xbal (pKSC8) and (ii) the restriction fragment Nael-HindlH 
of 700 bp (open ends) from pKSG2 was cloned in the Hindlll restriction site of 
pKSC8. The new construct, pNaNl, contained restriction sites Clal and Hindlll at 
a position located 15 nucleotides in front of the stop codon for the y-zein. 

Two synthetic nucleotides with the following sequences: SEQ ID N° 1 : 

10 5'CGATGAATTCAAACC AAAGCC AAAGCCGAAGCC AAAAGAATTC A3 \ 
and the inverse sequence termed SEQ ID N° 2, with the following sequence: 
5 ' AGCTTGAATTCTTTTGGCTTCGGCTTTGGCTTTGGTTTGAATTC AT3 ' 
coding for lysine-rich sequences termed (P-K) 4 , were hybridised, digested with 
EcoRI and cloned in a EcoRI site of pHbP2 and pPbP4. Three clones were 

15 selected: pPo2 and pHo3 containing the sequence coding for K(P-K) 4 and pHo4 
comprising the truncated form of the sequence coding for y-zein containing a 
tandem 2K(P-K) 4 (in the form of a sequence K(P-K) 4 EF K(P-K) 4 ) of the lysine- 
rich coding sequence. The same hybridised oligonucleotides were digested with 
Clal-Hindlll enzymes and cloned in plasmid pNaNl restricted using the same 

20 enzymes. The selected clone, pNol, contained the sequence coding for the lysine- 
rich sequence K(P-K) 4 at the N-terminal extremity of the corresponding modified 
y-zein. 

For transient transformation of the endosperm, sequences coding for the 
modified y-zein of pPo2 and pHo3 were inserted in the form of HincII-Nhel 

25 fragments in Smal-Xbal sites of pDH51 (Pietrzah et al, 1986, NucL Acid Res., 14, 
5857-5868) containing the 35S promoter of the cauliflower mosaic virus (CaMV). 
The promoter pP20yZ obtained by the insertion described above of HincII-Nhel 
fragments in plasmid pDH51 contained the coding sequence for ly sine-enriched y- 
zein (Figure 8) and signals of the 35S sequence of the CaMV virus for forming the 

30 3' end and polyadenilation. The chimeral coding sequence P20yZ was constructed 
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from the region coding for y-zein contained in the pKSG2 plasmid after different 
cloning steps. The 1.7 kb promoter for the y-zein (Reina et al., 1990, NucL Acids 
Res. 18, 6426) was inserted in the blunt ends of a Hindll-Pvul fragment in pH04 
and pNol restricted with XhOl and obtained with blunt ends. Constructs pH45yZ 

5 and pN13yZ were obtained respectively. 

The novel constructs, respectively termed pP20yZ, pH30yZ, pH45yZ and 
pN13yZ, were used in biolistic bombardment experiments. 

To study the specificity of different promoters as regards plant tissues, two 
constructs, pl.7yZGUS and pCaMV35SLUC were used. pl.7yZGUS was 

10 obtained by inserting the 1.7 kb y-zein promoter (Hindlll-Pvul) into a plasmid 
derived from pPuC18 containing the GUS gene and NOS signals for 
polyadenilation of pBI 101.1 at 3' (Jefferson et al., 1987, Embo. J. 6, 3901-3907). 
pCaMV35SLUC was obtained by inserting the gene coding for luciferase (LUC) 
from pAHC18 (Bruce et al., 1989, P. H. 86, 9692-9696) into the pDH51 

15 polylinker (Pietrzak et a., 1986, Nucl. Acids Res. 18, 6426). 

In-vitro analysis 

The plasmids derived from pBSKS containing the coding sequences for the 
y-zein (pKSG2) and lysine-rich y-zein (pPo2, pHo3, pHo4 and pNol) were 
transcribed in vitro using standard protocols (Sambrook et al, Molecular Cloning: 

20 A laboratory manual, 2 nd edition, Cold Spring Harbor Laboratory ed., Cold Spring 
Harbor, New York). In vitro translation and translocation of the synthetic 
transcripts was carried out using the Torrent et al technique (1994, Planta 192, 
512-518), with the exception of the canine microsomes (CM) which originated 
from Promega (Madison, Wis., USA). The translated products were 

25 immunoprecipitated essentially using the Borgese and Gaetani method (1980) 
using an anti-y-zeina-G2 rabbit serum (Ludevid et al, 1985, Plant Sci., 41, 41-48) 
and an aPL antiserum. aPL is a polyclonal rabbit antiserum obtained against the 
synthetic peptide EFK(P-K) 8 EF. This peptide was synthesised using the solid 
phase synthesis technique described by Celma et al., 1992. 

30 Microprojectile bombardment 
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Plasmidic DNA was absorbed onto gold particles (1.0 jim, Bio-Rad, Lab., 
Richmond, CA, USA) using a protocol described by Kikkert (Plant Cell, 33: 221- 
226, 1993). All of the targets were bombarded twice, using a BioRad Biolistic 
PDS/100/He apparatus. The targets were positioned 8 cm behind a screen 
stopping macrocarriers, which were positioned 1 cm below a 900 PSI rupture disk. 
After bombardment, the samples were incubated for 24 hours at 26°C in the dark. 
The controls were constituted by targets bombarded with microprojectiles 
containing no DNA. 

Enzymatic tests 

Tissues bombarded with pUyZGUS and pCaMV35SLUC plasmids were 
homogenised over ice in a buffer containing 25 mM of Tris, at a pH of 7.8, 2 mM 
of DTT, 10% of glycerol and 1% of Triton X-100. After centrifiigation at 12000 g 
for 5 minutes, the supernatants were decanted and the total soluble protein in the 
extracts was quantified using the Bradford test (Bio-Rad). The GUS activity was 
tested by fluorimetric analysis following the description by Jefferson (1987) using 
4-methyl-ombelliferyl-B-D-glucuronide (MUG) as a substrate. The LUC activity 
was determined using a Luciferase Assay System Kit sold by Promega, following 
the manufacturer's instructions. 

Extraction of protein reserves and gel analysis of proteins 

Endosperms transformed with pP20yZ, pffiOyZ, pH45yZ and pN13yZ 
were reduced to flour and the a-zeins were extracted by means of three series of 
solvents containing 70% of ethanol. The residual flour was air dried, and the total 
proteins were extracted with a buffer containing 0.25 M of Tris-HCl, pH 6.8, 4% 
of sodium dodecyl sulphate (SDS) and 5% of 2-mercaptoethanol, for 1 hour at 
ambient temperature. The protein extracts were analysed by SDS-PAGE and 
immunoblot following the description by Ludevid et al., 1985. Nitrocellulose 
leaves were incubated with aPL antiserum (dilution 1:500) and Raifort peroxidase 
conjugated with a secondary antibody (ECL Western Blotting System, Amersham, 
Buckinghamshire, UK) was used to detect the protein. 

Analysis of RNA expression 
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Total RNA was extracted in accordance with the description by Logemas 
et al, 1987. Complementary DNA (cDNA) was prepared using reverse 
transcriptase and oligo dT from Gibco BRL (Gaithersburg, MD, USA) following 
the manufacturer's instructions, and this RNA was amplified using a PCR reaction. 
5 Primer oligonucleotides used for the PCR were 20-mer sequences corresponding 
to the 5' and 3 5 ends of the y-zein structure. Standard protocols were used to 
prepare the 32 P labelled probes, and for gel analysis of the DNA (Sambrook et al., 
Molecular Cloning: A Laboratory Manual, 2 nd ed., Cold Spring Harbor Laboratory 
Ed., Cold Spring Harbor, New York) using a synthetic oligonucleotide coding for 
10 a lysine-rich sequence (see above) as a probe. 

Isolation of protein bodies and treatment with protease 
These protocols were described above (Torrent et al, Planta, 180: 90-95, 

1989). 

Electron microscopy 

15 The protein bodies of wild type endosperms and endosperms transformed 

by pP20yZ were fixed with 2.5% of paraformaldehyde in 20 mM of phosphate 
buffer at pH 7.2, for 1 hour at ambient temperature, and transformed in accordance 
with the description by Geli et al, 1994, Plant Cell 6, 191 1-1922), using, however, 
a aPL antiserum and a colloid of gold and protein A with a diameter of 15 nm. 

20 For double labelling, ultra fine sections were first incubated with aPL and the 
colloid of gold and protein A (15 nm diameter) was used to detect the antibody. 
After washing, sections were incubated with 0.15 mg/ml of protein A for 20 
minutes to saturate the immunoglobulins and finally the screens were incubated 
with a-G2 or a-Zl sera and the gold/protein A colloid (5 nm diameter) was used 

25 to detect the antibody. a-Zl is a polyclonal rabbit antiserum directed against the 
a-zein obtained following the description by Ludevid et al., 1985, Plant Sci., 41, 
41-48. 

30 
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Results 

Construct of lysine-rich y-zeins 

The inventors have demonstrated the importance of the proline-rich repeat 
and the cysteine-rich C-terminal domain for retention of the y-zein in the 

5 endoplasmic reticulum and the formation of protein bodies containing these 
proteins in the cells of Arabidopsis leaves (Geli et al, 1994, Plant Cell 6, 1911- 
1922). On the basis of these previous results, the possibility of inserting lysine-rich 
sequences in different domains of the y-zein, to create a modified y-zein correctly 
targeted and accumulated in the endosperm cells, was investigated to improve the 

10 nutritional qualities of the maize. 

The inventors have now constructed modified y-zein genes by introducing 
synthetic oligonucleotides coding for lysine-rich sequences into different sites of 
the y-zein coding sequence. Modified y-zein constructs were created so as to 
avoid placing lysine-rich coding sequences in domains constituted by the tandem 

15 repeat and the cysteine-rich domain. Modifications of the y-zein coding sequence 
were made in the sequence corresponding to the Pro-X domain. Further, to 
minimise any alteration to protein folding, the lysine-rich sequences (P-K) n were 
defined to imitate the sequence of the Pro-X domain. As can be seen in Figure 3, a 
sequence K(P-K) 4 has been introduced into the protein P20yZ after the Pro-X 

20 region and in protein H30yZ and in protein H45yZ, amino acid sequences including 
K(P-K) 4 and 2K(P-K) 4 respectively replace the Pro-X domain of the y-zein (yZ, 
fig, 3). To study whether the C-terminal extremity was a neutral site for the 
introduction of lysine-rich sequences, a supplemental N13yZ protein was created 
by inserting a sequence containing K(P-K) 4 five amino acids upstream of the C- 

25 terminal extremity (Figure 3). 

Activity of the y-zein promoter in transformed maize endosperm 
To determine whether lysine-rich y-zeins could be expressed in endosperm 
cells, an efficient promoter and a transformation system were researched in the first 
instance. A y-zein promoter containing an upstream 1.7 kb sequence (Reina et al, 

30 1990, Nucleic Acid Research, vol. 18, p 6426) and the CaMV promoter containing 
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625 bp of the sequence upstream of the 35S protein of the cauliflower mosaic virus 
CaMV were tested. Until now, no information has been available on the functional 
analysis of gene fusions with the y-zein promoter in transgenic monocotyledonous 
plants. To analyse the activity and tissue specificity of the y-zein promoter, two 
chimeral genes were constructed (see Figure 5). Transient expression by biolistic 
bombardment (Klein et al, 1988 PNAS 85: 4305) was used as the maize 
transformation procedure to analyse the promoter and for lysine-rich y-zein 
expression experiments. Maize endosperms at the 17 DAP (days after pollination) 
stage (the pericarp and the cells of the aleuronic layer were removed), embryos (17 
DAP) and young leaves (10 days old) were bombarded with gold projectiles coated 
with plasmidic DNA containing the two constructs. Figure 5 shows the 6- 
glucuronidase (GUS) activities and luciferase (LUC) activities present in the three 
tissues tested: endosperm, embryo and leaf, with respect to the control experiment. 
It should be noted that the results correspond to the average of at least 3 
independent experiments carried out. All GUC activity under the control of the y- 
zein promoter was restricted to the endosperm, since no GUS expression was 
detected in the embryo and in the leaves. Further, the bombarded endosperms 
were histochemically stained to determine the number of cell clusters expressing 
the GUS protein. The stain profile corroborated the above results, GUS being 
strongly expressed in the endosperms (the average number of GUS stained clusters 
per endosperm was 150) and blue spots were not detected in the embryo and in the 
leaves. In contrast, the promoter CaMV35S conferred a LUC activity on all of the 
tissues tested (Figure 5), but there were quantitative differences between the 
relative activity of the enzyme in the leaves and the embryos with respect to the 
endosperm. These differences could be attributed to an intrinsic variability in the 
constitutive activity of the CaMV promoter between the different maize tissues or 
to low penetration of DNA-coated particles into the mesophyll cells containing a 
large vesicular system. The prior art contains tests in which the CaMV promoter 
normally has low activity in monocotyledonous plant cells (Fromm et aL, 1985, 
Proc. Natl. Acad. Sci. USA 82, 5824-5828); however, the inventors have 
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demonstrated a high activity of the CaMV promoter in endosperm cells. This 
prompts the conclusion that the activity of the y-zein promoter and the CaMV35S 
promoter was very high in maize endosperms and thus the two promoters could be 
useful in controlling expression of the protein in this tissue. 

In order to determine if the mutant proteins coded by the constructs were 
competent with the membrane translocation function, in-vitro transcription- 
translation experiments were carried out in the presence of dog pancreatic 
microsomes. The synthetic transcripts of each construct were translated and 
translocation through the microsome membranes was tested by examining the 
protection as regards the digestion with K proteinase. These results, shown in 
Figure 4A, indicate that the apparent molecular weights of in-vitro synthesised 
polypeptides reflect the mutations introduced (Figure 4 A, lines 1, 5, 9 and 13). In 
the presence of microsomes and the K proteinase, low molecular weight peptides 
were not observed, indicating that the complete polypeptide chains of the modified 
y-zeins were transported through the microsomal membranes (Figure 4 A, lines 3, 
7, 11 and 15). By comparing the result of translocation of the four modified y- 
zeins, it can be seen that protein H45yZ (Figure 4 A, line 11), which contained the 
10 lysine type amino acid insertion, had undergone less translocation than the other 
proteins. It appears that negatively charged residues could interfere to a certain 
extent with translocation efficiency. Since the polyclonal antibody aG2, directed 
against y-zein (Ludevid et al, 1985, Plant Sci. 41, 41-48) could not be used to 
distinguish between the wild type y-zein and modified y-zeins, an aPL antibody 
directed against a synthetic peptide containing the lysine-rich amino acid sequence 
was prepared. It was then tested to examine whether the modified proteins 
synthesised in vitro were recognised both by the ocG2 antibodies and the aPL 
antibodies. This experiment is illustrated in Figure 4B and Figure 4C where the 
synthetic transcripts of the y-zein, P20yZ, H30yZ, H45yZ and N13yZ, were 
translated in-vitro and in which the translation products were immunoprecipitated 
with aPL (Figure 4B) and with aG2 (figure 4C). These results indicate that 
lysine-rich y-zeins were recognised by the two antibodies (see Figure 4B and C, 
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lines 2 to 5) and that the y-zein was only recognised by the otG2 serum (Figure 4B 
and C, line 1). Thus the specificity of the antibodies aPL for the modified proteins 
enabled the lysine-rich y-zein to be distinguished from the endogenic y-zein when 
the modified genes were expressed in endosperm cells. Taken together, these 

5 experiments have shown that the presence of lysine-rich sequences did not disturb 
the function of translocation through the membrane or the immunological 
behaviour of the y-zein. 

Analysis of expression of lysine-enriched y-zeins in maize endosperms 
To explore whether lysine-rich modified y-zein was expressed and 

10 accumulated in endosperm cells, stage 17 DAP grains were bombarded with DNA 
containing sequences coding for the protein of the four constructs (Figure 3) under 
the control of the CaMV promoter (P20yZ and H20yZ) and the y-zein promoter 
(H45yZ and N13yZ). Constructs including antisense promoters or free of 
promoters were used as controls. After 24 hours of endosperm transformation, the 

15 total proteins were extracted and the expression of modified y-zein was tested by 
immunoblotting using the aPL antibody. Figure 6A shows that the chimeral genes 
of y-zein containing the (Pro-Lys) n insertion after the Pro-X domain (P20yZ) or 
replacing it (H45yZ and H30yZ) were strongly expressed and the translation 
products accumulated efficiently in the endosperm cells (Figure 6 A, lines 3, 4 and 

20 5). For each line, the protein extracts corresponded' to about 1/3 of one 
bombarded endosperm, enabling it to be estimated that the quantity of modified 
proteins P20yZ, H30yZ and H45yZ per endosperm reached a nanogram level. 
Further, no quantitative difference between the level of expression of chimeral 
genes under the control of the CaMV promoter and the y-zein promoter was 

25 observed, confirming the results described above obtained with reporter proteins 
GUS and LUC (Figure 5). It should be noted that the aPL antibody recognised a 
protein of about 30 kD present in the total protein extracts, even in non 
transformed endosperms (see the weak band present in the four lines of Figure 
6A). A sequential protein extraction procedure established that this protein was 

30 not a protein reserve which was soluble in an aqueous medium. 
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Figure 6 A (line 2) shows that no trace of the N13yZ protein could be 
detected, indicating that the corresponding chimeral gene was not expressed in the 
endosperm cells or that the N13yZ protein had degraded. The RNAs of 
endosperms transformed with the DNAs coding for the H45yZ and N13yZ proteins 
and the RNAs of non transformed endosperms were analysed. From the total 
RNAs, the cDNAs were prepared and amplified by PCR using specific primers. 
Figure 6B shows the Southern blot analysis of three cDNA samples hybridised with 
an oligonucleotide coding for a sequence K(Pro-Lys) 4 used as a probe. The results 
indicated that the N13yZ gene was correctly expressed (Figure 6B, line 3). The 
presence of bands in the H45yZ and N13yZ samples but not in the non transformed 
endosperms, has suggested that the N13yZ protein was degraded during the 24 
hours of incubation. From these observations, the inventors have concluded that 
the insertion site for lysine-rich sequences was critical for the stability of the 
modified y-zein. 

Lysine-enriched y-zein is accumulated in protein bodies 
Apart from the lysine content in the Pro-X sequences, the P20yZ, H30yZ 
and H45yZ proteins had common characteristics with the wild type y-zein: they had 
the signal peptide, the N-terminal tandem repeat and the cysteine-rich C-terminal 
region in common. It appeared important to determine whether these domains 
remained completely functional preserving targeting and formation of the protein 
bodies or whether the lysine-rich sequences created a special environment in which 
these properties could be perturbed. To test this, an investigation was made as to 
whether the modified y-zeins were capable of accumulating in protein bodies. A 
sub-cellular fractionation was carried out with transformed endosperms. 
Homogenates of bombarded endosperms were charged onto discontinuous sucrose 
gradients (20%, 50% and 70% of sucrose) and all of the fractions collected were 
analysed by immunoblotting. P20yZ, H30yZ and H45yZ sedimented on the protein 
body fraction and no significant quantity of these proteins was detected either in 
the supernatant or in the microsomal fraction (Figure 7 A, lines 2, 2 and 3). While 
the in-vitro experiments previously carried out (Figure 4A) had established that the 
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newly synthesised modified y-zeins underwent translocation in canine microsomes, 
the test in this case was whether the modified proteins expressed in-vivo in 
endosperm cells underwent translocation in the membrane of the endoplasmic 
reticulum and remained inside protein bodies derived from the endoplasmic 
5 reticulum. For this reason, isolated protein bodies were digested with K proteinase 
in isotonic buffers (containing 20% of sucrose) or after an osmotic shock in water 
(Figure 7B). Proteins protected against proteolytic degradation by enzymes can be 
surrounded by a membrane and treatment with detergent or hypotonic solutions 
results in digestion of the proteins (Walter and Blobel, 1983, Method Enzymol. 96, 
10 84-93). A comparison of the band intensities after digestion with K proteinase in 
media comprising sucrose or water revealed that the P20yZ, H30yZ and H45yZ 
proteins were protected from digestion in isotonic buffers (lines 1, 3 and 5) but 
were partially digested in water (lines 2, 4 and 6). 

Expression of modified genes of y-zein in the cells of the sub-aleuronic 
15 layer of the endosperm by biolistic bombardment resulted in the observation that 
lysine-rich y-zeins were accumulated to a great extent with the exception of the 
case where the lysine-rich sequences were positioned 5 residues upstream of the C- 
terminal extremity of the y-zein polypeptide. From this expression and 
immunocytochemical studies on isolated protein bodies, the inventors have 
20 demonstrated that lysine-rich y-zeins are properly accumulated in these organelles 
and are co-localised with the endogenous y-zein and a-zein proteins. 

Protein bodies isolated from P20yZ endosperms were examined by 
immunogold type labelling and electron microscopy. On ultra fine sections 
incubated with the aPL antibody (Figure 8A), the gold labelling was detected 
25 inside the protein bodies, indicating that the lysine-rich protein P20yZ was 
accumulated inside these organelles. In sections incubated only with aPL 
antibody, immunolabelling took place only on some protein bodies (containing 
lysine-rich y-zein), the large proportion of the isolated protein bodies were not 
immunolabelled with aPL antibodies as they corresponded to non transformed 
30 endosperm cells. To determine whether the lysine-rich y-zein was co-localised 
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with the oc-zeins and y-zeins, a double labelling using immuno-electron microscopy 
was carried out on isolated protein bodies using the aZ and aPL antibodies (Figure 
8B) and aG2 and aPL antibodies (Figure 8C and D). Figure 8B shows a 
micrograph of the transverse section of two protein bodies labelled with aPL 

5 antibody (15 nm gold particle) and with the aZ antibody (5 nm gold particle). The 
result of immunostaining showed that the P20yZ protein was accumulated in the 
protein bodies and co-localised with the a-zein (see the extent of labelling of the a- 
zein over the whole surface of the protein body). Further, tangential sections 
(Figure 8B, see arrows) and transverse sections (Figure 8D) of protein bodies were 

10 incubated with the aPL antibody (15 nm gold particle) and with the aG2 antibody 
(5 nm gold particle). In the two cases, the P20yZ protein was co-localised with the 
y-zein polypeptides. It was noted that the tangential sections of the protein body 
(Figure 8A, C see arrows), was easily distinguished from the transverse sections of 
the protein body in that the former had a higher electron density and the y-zein 

15 labelling extended over the whole section. In contrast, the transverse sections had 
a lower density and the y-zein labelling was localised on the membrane surrounding 
the protein body. In both cases, the labelling localisation of the lysine-rich y-zein 
followed that of the endogenous y-zein. 
B) Preparation of 2eneticallv modified plants expressing lvsine-ri ch y-zeins 

20 1) Production and use of maize callus as a target for genetic transformation 
Genetic transformation of maize, regardless of the method used 
(electroporation; Agrobacterium . microfibres, particle cannon) generally requires 
the use of undifferentiated cells in rapid division which have conserved an ability to 
regenerate whole plants. This type of cell constitutes the embryogenic friable 

25 callus (type II) of maize. 

These calli were obtained from immature embryos of the Hi II or 
(A188xB73) genotype using the method and media described by Armstrong 
(Maize Handbook; (1994) M. Freeling, V. Walbot Eds; pp 665-671). The calli 
obtained were multiplied and maintained by successive subculturing every fortnight 

30 onto the initiation medium. 
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Plantlets were then regenerated from these calli by modifying the hormonal 
and osmotic balance using the method described by Vain et al, (Plant cell Tissue 
and Organ Culture (1989 18: 143-151). These plants were then acclimatised under 
glass where they could be crossed or self-fertilised. 

5 2) Use of particle cannon for genetic transformation of maize 

The above paragraph described the production and regeneration of cell 
lines necessary for transformation; this section describes a genetic transformation 
method leading to stable integration of modified genes into the plant genome. This 
method is based on the use of a particle cannon identical to that described by J. 

10 Finer (Plant Cell Report (1992) 11: 323-328); the target cells were fragments of 
calli described in paragraph 1 . 4 hours before bombardment these fragments, with a 
surface area of 10 to 20 mm 2 , were disposed, in an amount of 16 fragments per 
dish in the centre of a petri dish containing a culture medium identical to the 
initiation medium, with an addition of 0.2 M of mannitol + 0.2 M of sorbitol. 

15 Plasmids carrying the genes to be introduced were purified on a Qiagen^ column in 
accordance with the manufacturer's instructions. They were then precipitated onto 
tungsten particles (M10) following the protocol described by Klein (Nature (1987) 
327: 70-73). The coated particles were projected against the target cells using a 
cannon and following the protocol described by J. Finer (Plant Cell Report (1992) 

20 11: 323-328). 

The bombarded dishes of calli were then sealed with Scellofrais® then 
cultivated at 27°C in the dark. The first subculture took place 24 h later then every 
fortnight for 3 months to a medium identical to the initiation medium with a 
selective agent added, the nature and concentration of which could be varied 

25 depending on the gene used (see paragraph 3). The selective agents which could 
be used generally consisted of active compounds of certain herbicides (Basta®, 
Round Up®) or certain antibiotics (Hygromycin, Kanamycin. . .). 

After three months or sometimes earlier, calli were obtained the growth of 
which was not inhibited by the selective agent, normally and mainly composed of 

30 cells resulting from division of a cell which had integrated one or more copies of 
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the selection gene into its genotype. The frequency of producing such calli was 
about 0.8 calli per bombarded dish. 

These calli were identified, individualised, amplified then cultivated so as to 
regenerate plantlets (see paragraph 1). In order to avoid any interference with non 
5 transformed cells, all of these operations were carried out on culture media 
containing the selective agent. 

The regenerated plants were acclimatised then cultivated under glass where 
they could by crossed or self-fertilised. 
3) Use of bar gene to produce genetically modified maize plants which have 
10 incorporated and which express the H45yZ gene 

The bar gene from Streptomyces hygroscopicus codes for a 
phosphinothricin acetyl transferase (PAT) which inactivates the active 
phosphinothricin molecule of the herbicide Basta® by acetylation. Cells with this 
gene are thus rendered resistant to this herbicide and can be selected by using it. 
15 For the cereal transformation, the coding sequence of the bar gene is under the 
control of a regulating region enabling strong and constitutive expression in plant 
cells. Such a region is advantageously constituted by the promoter and the first 
intron of the actin gene of rice as described by McElroy (Mol. Gen. Genet. (1991) 
231: 150-160). 

20 This chimeral gene is cloned on a plasmid enabling its amplification by 

Escherischi Coli. After amplification then purification on a Qiagen® column, this 
plasmid (pDM 302 Cao (Plant Cell Report (1992) 11: 586-591) can be used for 
genetic transformation of maize using, for example, the method described in the 
previous example. In this case, 2 mg/L of phosphinothricin was added to culture 

25 media intended to select transformed cells. 

To introduce the H45yZ gene, a co-transformation technique is 
advantageously used: the selection gene (bar) and gene of interest (H45yZ) were 
carried by independent plasmids. When using a particle cannon, the plasmids were 
co-precipitated onto tungsten particles, the total quantity of DNA precipitated on 

30 the particles remaining identical to that used in the standard protocol (5 |ug of 
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DNA per 2.5 mg of particles), each plasmid representing about half of the total 
weight of DNA used. 

The experiment shows that with this method, co-integration of the plasmids 
in the plant cells is the most frequent event (of the order of 90%) i.e., practically 
5 every plant which had integrated the bar gene and been selected by its use also 
carried the H45yZ gene. The level of co-expression (percentage of selected plants 
expressing the H45yZ gene) was normally of the order of 70%. 

The genes thus introduced were generally linked in the genetic sense, thus 
gene H45yZ could advantageously be followed through its descendants because of 
10 its resistance to the herbicide closely associated with it. 

The quantity of modified protein was determined using the methods 
described in Example A, in particular by immunoblotting onto protein extracts 
from immature or mature maize grains, removed as a pool from plants resistant to 
Basta 1 ". 

15 4) Example explaining the step of introducing transgenes, in particular the 
gene coding for H45yZ, to modify the maize opaque-2 phenotype 

Improving opaque-2 maize by introgression of the lysine-rich y-zein. 
Transformed plants described in the previous example, with both a 
resistance to Basta and expressing a lysine-rich y-zein, were used. Their pollen 
20 was used to fertilise opaque-2 maize plants from the W64 Ao2 line which contained 
only a single y-zein gene. This line was obtained from the Maize Stock Center. 
The plants and its Fl descendants were selected for their resistance to Basta and 
then self-fertilised. The F2 grains produced were analysed for the opaque 
phenotype on a light table and opaque or vitreous grains were sown and evaluated 
25 for Basta resistance. In the case when opaque grains are sensitive to Basta, 
introduction of the lysine-enriched y-zein into the plant under consideration has 
been demonstrated to complement the opaque-2 phenotype. 

In these Basta resistant plants, o2/o2 genotype individuals with only one y- 
zein gene on chromosome 7 were selected using molecular probes coding for the 
30 opaque-2 gene and for the y-zein. These latter revealed polymorphic restriction 
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fragments and only individuals with the patron type of the W64o2 line were 
retained (Lopes M. A. et al. ? 1995, Mol. Gen. Genet. 19: 247, 603-613). 

These individuals had a lysine content which was on average equivalent to 
or greater than that of o2 maize. From these individuals, any introgression in 
5 ELITE varieties, with a "high lysine content" character was observed by 
determining the resistance to BASTA and the presence of the o2 aileile detected by 
RFLP. 

5) Expression of lysine-enriched y-zeins in Arabidopsis thaliana 

In order to obtain stable transformation, plasmid constructs P20yZ and 
10 pH30yZ cloned in the Bluescript KS (-) plasmid were inserted in the form of 
HincII/Xbal fragments in the binary vector pBinl9 (Bevan, M. Nucl. Acids Res. 12: 
8711-8721 (1984)), containing the 35S promoter of the cauliflower mosaic virus 
(CaMV) and formation signals for the 3 5 end and polyadenylation of the oct opine 
synthetase gene (ocs). The new plasmids were termed pl9P20yZ and p!9H30yZ 
15 (Figure 12). 

The binary vectors containing the sequences coding for the proteins P20yZ 
and H30yZ (pl9P20yZ and pl9H30yZ) were transferred to the LBA4404 strain of 
Agrobacterium tumefaciens . Ecotype RLD Arabidopsis plants were transformed 
using the method described by Valvekens D., Van Montagu, M and van 

20 Lijsebettens, M. ((1988) Proc. Natl. Acad, Sci. USA 85: 5536-5540). For each 
construct, 10 transgenic plants were screened by immunoblot analysis using an 
antiserum obtained against the y-zein (ccG2 ? Ludevid et al. 1985). The plants 
containing the highest amounts (corresponding to about 0.1% of the total quantity 
of proteins present in Arabidopsis leaves) of the transgenic products in generation 

25 Fl were selected to obtain generation F2. These plants were also selected for 
expression of the desired protein. 

Whole transgenic plants, selected in a medium containing kanamycin, were 
homogenised in liquid nitrogen. The transgenic proteins were selectively extracted 
with a solution containing ethanol/ 0.125 N hydrochloric acid HCl in a proportion 

30 of 3:1 (v/v) with 5% of mercaptoethanol and protease inhibitors. The proteins 
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extracted with this solution were precipitated in 5 volumes of acetone and analysed 
by SDS-PAGE and immunoblotting. The protein extracts from non transgenic 
plants were used as controls. The proteins resulting from insertion of K(P-K)4 
sequences in the y-zein were properly expressed in Arabidopsis thaliana plants 
5 using the constitutive promoter 35S from CaMV. On the immunoblots, antibodies 
aG2 and aPL recognised electrophoresis bands corresponding to proteins P20yZ 
and H30yZ. These bands migrated with apparent molecular weights in accordance 
with those which were previously observed in the in-vitro translation/translocation 
experiments (30 kD and 26 kD respectively). As observed in the transgenic 
10 Arabidopsis plants expressing y-zein (Geli et al, Plant Cell 6: 1911-1922 (1994)), 
the proteins P20yZ and H30yZ migrated in the form of two electrophoresis bands, 
namely the bands corresponding to 36 and 30 kD for P20yZ and the bands 
corresponding to 32 and 26 kD for H30yZ. The higher bands could correspond to 
products which have undergone post-translational modifications. Such a post- 
15 translational modification was not detected in the transformed maize endosperms. 
This result suggests that the modification would appear when these proteins are 
expressed in a heterologous system such as Arabidopsis thaliana . 

6) Expression of recombinant lysine-enriched y-zeins in maize 

Method 

20 After obtaining transgenic plants, they were crossed with a non transformed 

male line. As a result, 50% of the grain harvested in the case of unilocus insertion 
will be transgenic. With the aim of analysing the lysine-enriched y-zeins in the 
transgenic plants, the proteins were extracted from 6 grains per transformant 

The endosperms were dissected by removing the embryos and pericarps 

25 from the grains. The endosperms were ground and 50 mg of flour was used for 
selective extraction. Previously the a-zeins were extracted by three treatments 
with 70% ethanol. After centrifugation, the ethanol was vacuum evaporated. The 
proteins which were insoluble in ethanol (principally the y-zeins and lysine-enriched 
y-zeins) were extracted from the residue with a Laemli buffer containing 10% of 

30 mercaptoethanol (100 jil of buffer per 10 mg of flour). The total proteins were 
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then analysed by staining with silver (Morrissey, J. H. 1981, Ann. Biochem. Vol. 
117, p 307-310). The y-zeins and lysine-enriched y-zeins were analysed by 
immunoblot using aG2 (dilution 1/2000) and aPL (dilution 1/500) antibodies 
respectively. An anti-rabbit antibody antibody conjugated with alkaline 
5 phosphatase was used as the secondary antibody in the immunoblot. The extracts 
were diluted to enable them to be charged onto SDS-PAGE in accordance with the 
analytical method used. 

Results 

Accumulation of lysine-enriched y-zein in transgenic 20yZ and 45yZ maize 
10 plants 

Figure 13 shows an immunoblot of protein extracts revealed with the aPL 
antiserum (A, B) and the total proteins after silver staining (C). As can be seen in 
Figure 13 A, 6 20yZ transgenic plants were tested with the aPL antiserum and the 
lysine-enriched y-zein was expressed in transgenic plants Al and D2 (tracks 1 and 

15 6 respectively). In plant CI (track 4), only traces of lysine-enriched y-zein were 
observed. When the extracts from 6 transgenic 45yZ plants were charged onto gel 
and labelled with ccPL antiserum (Figure 13B), a strong reaction with the antibody 
was observed for transformants Bl and CI (tracks 3 and 4 respectively). 

It should be noted that the reaction with the antibody in the extracts from 

20 endosperms from plants 45yZ Bl and CI was stronger than in the extracts from 
plants 20yZ Al and D2. This result was confirmed after staining the gels with 
silver (Figure 13C) where the 2 types of y-zein: endogenic and lysine-enriched, are 
stained. The lysine-enriched y-zein had an apparent molecular weight of 30 kDa 
and that of the endogenous y-zein was 28 kDa. Expression of lysine-enriched y- 

25 zein was weaker in the 45yZ Bl plant than in the CI (tracks 2 and 3 respectively). 
In tracks 1 to 6 of Figure 13C, an identical dilution of the proteins from endosperm 
extracts of plants 45yZ and 20yZ were deposited on the gel At this dilution, 
expression of the lysine-enriched y-zein was detected only in endosperms from 
plants 45yZ Bl and CI. However, when a larger extract (40 jig of proteins per 

30 track) of 20yZ proteins was charged onto the gel, a faint band (see arrow) 
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corresponding to the lysine-enriched y-zein was detected in endosperms 20yZ Al 
and D2. This result indicates that 45yZ plants accumulate many more proteins of 
the invention than P20yZ plants. This is probably due to the different activities of 
the promoters. 45yZ plants were transformed with a construct containing the y- 

5 zein of the invention under the control of the y-zein promoter (1 .7 kb) while 20yZ 
plants were transformed with the same coding sequence but under the control of 
the CaMV 35S promoter. Silver staining is a general protein staining technique, 
but the strong brown colour is especially observed in the presence of basic 
proteins. Since the a-zeins had been extracted from the flour as described above, 

10 they were absent during the SDS-PAGE analysis of Figure 13C. 

Segregation 

Since there is 50% segregation in all transgenic plants in the case of a single 
locus, grain by grain analysis was carried out to quantify the amount of lysine- 
enriched y-zein in each transformant. The analysis was carried out only with 45yZ 

15 transformants which had a stronger degree of expression. Figure 14 shows the 
silver staining (A) and immunoblot with aPL (B) of 5 different endosperms of 
45yZ Bl and CI. The faint electrophoretic band present on all tracks (see A, 
tracks 1 to 10) correspond to the endogenic y-zein. As can be seen in Figure 14 A, 
2 of the 5 endosperms accumulated lysine-enriched y-zein (see tracks 3, 4 and 6, 

20 7). As a result, about half of the grains accumulated significant quantities of lysine- 
enriched proteins. If the fact that identical quantities of proteins were deposited on 
the gel was taken into account, it is observed that 45yZ CI transformant had 
accumulated more lysine-enriched y-zein than Bl. In fact this result was in 
agreement with that observed for silver staining of extracts mixed with endosperm 

25 (Figure 13 C, track 3). The proof of the presence of lysine-enriched y-zein in these 
endosperms is underlined in Figure 14B. The immunoblot using the aPL antiserum 
shows that 2 endosperm extracts of 45yZ Bl (tracks 3 and 4) and 45yZ CI (tracks 
4 and 6) accumulated lysine-enriched y-zein. To confirm this percentage of 
transgenic grains, 10 new grains of 45yZ CI transformant were analysed by 

30 immunoblot and using aPL antiserum (Figure 15 A). As expected, about half of 
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the transgenic grains were detected. An immunoreactive band was observed in the 
endosperm extracts (tracks 1, 2, 9 and 10). 
Estimation of the quantity of lysine-enriched y-zeins in the endosperms of 

45yZ transformants 

5 aG2 is a polyclonal antiserum which recognises endogenous y-zeins and 

lysine-enriched y-zeins. The reactivity of this antiserum with extracts of 45yZ CI 
endosperms was used to quantify the quantity of lysine-enriched y-zeins of the 
invention in the endosperms of transformed plants. 

Figure 15B shows the immunoblot of 5 protein extracts corresponding to 5 

10 45yZ CI endosperms (tracks 1 to 5). As expected, only 2 endosperms showed an 
immunoreaction profile characteristic of transgenic grains. The upper band of 30 
kDa corresponded to lysine-enriched y-zein (arrows on track 1 and 2) and the 
lower band corresponded to endogenous y-zein. It should be noted that in the 
endosperms of non transgenic plants, the upper band was absent (tracks 3, 4 and 

15 5). Surprisingly, it appears that the amount of endogenous y-zein was lower in the 
transgenic plant extracts than in non transgenic plants (see arrows in tracks 1 and 
5). 

At first sight, it was observed that: 

i) in 45yZ CI transgenic endosperms, the ratio of lysine-enriched y-zein/endogenic 
20 y-zein was 7/3. Thus the quantity of modified protein of the invention was at 

least twice that of the endogenous protein; 

ii) the quantity of endogenic protein in the non transgenic endosperms (see tracks 
3, 4 and 5) was equivalent to that of the lysine-enriched y-zein in the transgenic 
endosperms (see tracks 1 and 2). 

25 7) Expression of recombinant lysine-enriched y-zeins in wheat 

As in Example 6), it is possible to demonstrate the presence of lysine- 
enriched y-zeins of the invention in wheat. 

Wheat can be transformed using the method described by Weeks et al., 
1993, Plant Physiol., vol 102: pages 1077-1084 or using the method described in 
30 EP-A-0 709 462. 



CLAIMS 

An oligonucleotide comprising at least one concatenation coding for a 
polypeptide with formula (P-K) n , where: 

• n is a whole number of 2 or more; 

• P represents a proline amino acid residue; 

• K represents a lysine amino acid residue; 

• the symbol represents a bond between the two amino acid residues, 
in particular a peptide type bond, the n (P-K) units also being bonded 
together by such bonds, for example peptide type bonds. 

An oligonucleotide according to claim 1, comprising a concatenation 
coding for a polypeptide with formula (P-K) n where n is a whole number of 
3 or more, and preferably n is equal to 4, 5, 6, 7, 8, 9, 10 or 15. 
An oligonucleotide according to claim 1 or claim 2, comprising a 
concatenation coding for a polypeptide with formula (P-K)„, in which the 
sequence of n (P-K) units is interrupted by one or more amino acid residues 
other than P or K residues. 

An oligonucleotide according to any one of claims 1 to 3, characterized in 
that the concatenation coding for the polypeptide comprising the n (P-K) 
units is completed at its 5 5 end and/or at its 3' end by one or more codons 
coding, for example, for at least one lysine residue at the N-terminal 
extremity of the formed polypeptide. 

An oligonucleotide according to claim 4, characterized in that it codes for a 
polypeptide with formula (P-K), formula K-(P-K) 4 , or with formula 
2K(P-K) 4 . 

A recombinant nucleotide sequence comprising a concatenation of 
nucleotides coding for a plant protein reserve, characterized in that it further 
comprises an oligonucleotide according to any one of claims 1 to 5, 
inserted at one site of the nucleotide concatenation selected such that: 

• expression of the nucleotide sequence in a particular plant cell enables a 
modified protein reserve to be produced which is localised in that cell in 
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a manner identical to or similar to the normal protein reserve which 
would be expressed in the same cell under the same conditions by the 
corresponding normal coding nucleotide concatenation; and/or 
• the modified protein reserve coded by the recombinant nucleotide 
5 sequence is immunologically recognised by antibodies produced against 

the corresponding normal protein reserve. 
7. A nucleotide sequence according to claim 6, characterized in that the coding 
nucleotide concatenation it comprises codes for a protein reserve which is 
naturally low in lysine. 

10 8. A nucleotide sequence according to claim 7, characterized in that the coding 
nucleotide concatenation it comprises codes for a protein reserve naturally 
produced by a plant for use in animal or human foodstuffs. 

9. A nucleotide sequence according to claim 7 or claim 8, characterized in that 
the coding nucleotide concatenation it comprises codes for a protein 

15 reserve naturally produced by a plant from the cereal family. 

10. A nucleotide sequence according to claim 7 or claim 8, characterized in that 
the coding nucleotide concatenation it comprises codes for a protein reserve 
naturally produced by a plant from the legume or crucifer family. 

11. A nucleotide sequence according to claim 9, characterized in that the coding 
20 nucleotide concatenation it comprises codes for a maize protein reserve. 

12. A nucleotide sequence according to claim 11, characterized in that the coding 
nucleotide concatenation it comprises codes for a protein reserve from the zein 
family. 

13. A nucleotide sequence according to claim 12, characterized in that the coding 
25 nucleotide concatenation it comprises codes for a protein reserve which is 

maize y-zein, 

14. A nucleotide sequence according to claim 13, characterized in that the 
nucleotide concatenation coding for the maize y-zein it contains has the 
sequence shown in Figure 9. 
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15. A nucleotide sequence according to claim 7 or claim 8, characterized in that the 
coding nucleotide concatenation it comprises codes for a protein reserve of a 
plant selected from the following: soya, sunflower, tobacco, wheat, oats, 
alfalfa, rice, oilseed rape, sorghum, and Arabidopsis thaliana. 
5 16. A nucleotide sequence according to claim 13 or claim 14, characterized in that 
the oligonucleotide is inserted in place of or following the Pro-X domain or in 
the Pro-X domain naturally present in the maize y-zein, 

17. A recombinant nucleotide sequence, characterized in that it comprises a 
nucleotide sequence according to any one of claims 1 to 16 under the control of 

10 an expression promoter. 

18. A recombinant nucleotide sequence according to claim 1 7, characterized in that 
the promoter is a specific promoter for a given cell tissue, for example a 
promoter which is specific for expression in grains, and/or in the leaves of 
plants. 

15 19. A nucleotide sequence according to claim 17, characterized in that the 
expression promoter is that of maize y-zein. 

20. A nucleotide sequence according to claim 17, characterized in that the 
expression promoter is the promoter CaMV35S. 

21. A nucleotide sequence according to any one of claims 16 to 20, characterized 
20 in that it codes for one of the polypeptides P20yZ or H45yZ with the sequences 

shown in Figures 1 1 and 10 respectively. 

22. A cloning and/or expression vector, characterized in that it comprises, at a site 
which is not essential for replication, a nucleotide sequence in accordance with 
any one of claims 1 to 20. 

25 23. A cloning and/or expression vector, characterized in that it is one of plasmids 
pP20yZ (CNCM N° 1-1640), pffiOyZ or pH45yZ (CNCM N° 1-1639). 

24. A polypeptide coded by a sequence according to any one of claims 1 to 17. 

25. A lysine-enriched modified maize y-zein, characterized in that it is coded by a 
nucleotide sequence according to claim 13 or claim 14. 
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26. A lysine-enriched modified maize y-zein, characterized in that its amino acid 
sequence is modified by at least one polypeptide with formula (P-K) n or 
with formula 2K(P-K) n , where: 

• n is a whole number of 2 or more; 

5 • P represents a proline amino acid residue; 

• K represents a lysine amino acid residue; 

• the symbol represents a bond between the two amino acid residues, 
in particular a peptide type bond, the n (P-K) units being bonded 
together by bonds, in particular peptide type bonds, said polypeptide 

10 having formula (P-K) n or K-(P-K) n being substituted for a sequence 

naturally present in the normal maize y-zein or being inserted with 
deletion of one or more amino acids of the amino acid sequence for 
normal maize y-zein, or being added to the normal y-zein amino acid 
sequence, the insertion site for the polypeptide being selected such that: 

15 • when the modified ly sine-rich y-zein is produced in a host cell, in 

particular in a plant cell, it is localised in identical or similar manner to 
the normal maize y-zein which would be produced under the same 
conditions in the same host cell; and/or 

• the modified maize y-zein is recognised by antibodies directed against 
20 the normal maize y-zein. 

27. A modified maize y-zein according to claim 26, characterized in that it is 
the protein P20yZ or the protein H30yZ or the protein H45yZ. 

28. A recombinant host cell, characterized in that it comprises a nucleotide 
sequence according to any one of claims 1 to 17. 

25 29. A host cell according to claim 28, characterized in that it is a bacterium, for 
example E. coli or Agrobacterium tumefaciens . 

30. A host cell according to claim 28, characterized in that it is a plant cell. 

31. A host cell according to claim 30, characterized in that it is a plant seed 
cell 
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32. A host cell according to claim 3L characterized in that it is a cell from 
maize seed endosperm. 

33. A host cell according to claim 32, characterized in that it contains a 
nucleotide sequence according to claim 13, integrated in its genome in a 

5 stable manner. 

34. A host cell according to claim 32, characterized in that it produces a lysine- 
enriched modified maize y-zein according to claim 26 or claim 27. 

35. A host cell according to claim 30, characterized in that it is a soya, 
sunflower, tobacco, wheat, oats, alfalfa, rice, oilseed rape, sorghum or 

10 Arabidopsis cell 

36. Seeds producing a polypeptide according to any one of claims 24 to 27. 

37. A plant producing a polypeptide according to any one of claims 24 to 27. 

38. A plant according to claim 37, characterized in that it is a maize plant. 

39. Seeds obtained from plants according to claim 37 or claim 38. 

15 40. A method of producing plants or seeds expressing a modified protein 
reserve, characterized in that it comprises the steps of: 

a) transforming a plant cell with a nucleotide sequence according to any 
one of claims 6 to 17, or a vector according to claim 22 or claim 23, 
under conditions enabling the modified protein reserve coded by the 

20 nucleotide sequence to be expressed in a stable and functional manner; 

b) regenerating plants from the plant cell transformed in step a), to obtain 
plants expressing the modified protein reserve; 

c) if necessary, obtaining seeds from the modified plants obtained in step 
b). 

25 41. A method according to claim 40, characterized in that the plant is maize 
and the protein reserve is y-zein. 
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